Goto

Collaborating Authors

 cause-and-effect relationship




Causal Distillation: Transferring Structured Explanations from Large to Compact Language Models

arXiv.org Artificial Intelligence

Large proprietary language models exhibit strong causal reasoning abilities that smaller open-source models struggle to replicate. We introduce a novel framework for distilling causal explanations that transfers causal reasoning skills from a powerful teacher model to a compact open-source model. The key idea is to train the smaller model to develop causal reasoning abilities by generating structured cause-and-effect explanations consistent with those of the teacher model. To evaluate the quality of the student-generated explanations, we introduce a new metric called Causal Explanation Coherence (CEC) to assess the structural and logical consistency of causal reasoning. This metric uses sentence-level semantic alignment to measure how well each part of the generated explanation corresponds to the teacher's reference, capturing both faithfulness and coverage of the underlying causal chain. Our framework and the CEC metric provide a principled foundation for training smaller models to perform robust causal reasoning and for systematically assessing the coherence of explanations in language model outputs.


Emerging categories in scientific explanations

arXiv.org Artificial Intelligence

Clear and effective explanations are essential for human understanding and knowledge dissemination. The scope of scientific research aiming to understand the essence of explanations has recently expanded from the social sciences to include the fields of machine learning and artificial intelligence. Important contributions from social sciences include [18, 17, 22, 13, 5, 11] with works that examine critical aspects such as causality (cause-and-effect relationships), contrast (distinctions between differing scenarios), relevance (applicability of explanations), and truth (accuracy and verifiability of explanations). However, machine learning and natural language processing focus more on operational definitions and on the importance of constructing datasets, as seen in studies by [21, 23, 6]. Since explanations for machine learning decisions must be both impactful and human-like [10, 3, 20, 12, 4], a major challenge lies in developing explanations that emphasize proximal aspects -- details that are immediately relevant, direct and related to the user -- over broad algorithmic processes [21].


Regularized Multi-LLMs Collaboration for Enhanced Score-based Causal Discovery

arXiv.org Artificial Intelligence

As the significance of understanding the cause-and-effect relationships among variables increases in the development of modern systems and algorithms, learning causality from observational data has become a preferred and efficient approach over conducting randomized control trials. However, purely observational data could be insufficient to reconstruct the true causal graph. Consequently, many researchers tried to utilise some form of prior knowledge to improve causal discovery process. In this context, the impressive capabilities of large language models (LLMs) have emerged as a promising alternative to the costly acquisition of prior expert knowledge. In this work, we further explore the potential of using LLMs to enhance causal discovery approaches, particularly focusing on score-based methods, and we propose a general framework to utilise the capacity of not only one but multiple LLMs to augment the discovery process.


Are LLMs Capable of Data-based Statistical and Causal Reasoning? Benchmarking Advanced Quantitative Reasoning with Data

arXiv.org Artificial Intelligence

Quantitative reasoning is a critical skill to analyze data, yet the assessment of such ability remains limited. To address this gap, we introduce the Quantitative Reasoning with Data (QRData) benchmark, aiming to evaluate Large Language Models' capability in statistical and causal reasoning with real-world data. The benchmark comprises a carefully constructed dataset of 411 questions accompanied by data sheets from textbooks, online learning materials, and academic papers. To compare models' quantitative reasoning abilities on data and text, we enrich the benchmark with an auxiliary set of 290 text-only questions, namely QRText. We evaluate natural language reasoning, program-based reasoning, and agent reasoning methods including Chain-of-Thought, Program-of-Thoughts, ReAct, and code interpreter assistants on diverse models. The strongest model GPT-4 achieves an accuracy of 58%, which has much room for improvement. Among open-source models, Deepseek-coder-instruct, a code LLM pretrained on 2T tokens, gets the highest accuracy of 37%. Analysis reveals that models encounter difficulties in data analysis and causal reasoning, and struggle in using causal knowledge and provided data simultaneously. Code and data are in https://github.com/xxxiaol/QRData.


Efficient Causal Discovery for Robotics Applications

arXiv.org Artificial Intelligence

Using robots for automating tasks in environments shared with humans, such as warehouses, shopping centres, or hospitals, requires these robots to comprehend the fundamental physical interactions among nearby agents and objects. Specifically, creating models to represent cause-and-effect relationships among these elements can aid in predicting unforeseen human behaviours and anticipate the outcome of particular robot actions. To be suitable for robots, causal analysis must be both fast and accurate, meeting real-time demands and the limited computational resources typical in most robotics applications. In this paper, we present a practical demonstration of our approach for fast and accurate causal analysis, known as Filtered PCMCI (F-PCMCI), along with a real-world robotics application. The provided application illustrates how our F-PCMCI can accurately and promptly reconstruct the causal model of a human-robot interaction scenario, which can then be leveraged to enhance the quality of the interaction.


The Gap Between Deep Learning and Human Cognitive Abilities - KDnuggets

#artificialintelligence

Hi! I am Bohdan Ponomar, CEO at the AI HOUSE community. We are part of the ecosystem being built up by Roosh technology company. Roosh creates ML/AI projects and invests in innovative ideas in the industry. Our ecosystem also includes Pawa venture studio, Roosh Ventures venture fund, SET University technological university, Reface and ZibraAI startups, and Neurons Lab company. The first lecture of the series was delivered by Yoshua Bengio, professor at the University of Montreal, founder and scientific director of the Quebec Artificial Intelligence Institute, head of the CIFAR Learning in Machines & Brains program, and one of the leading experts in the AI industry.


How can Machine Learning algorithms include better Causality?

#artificialintelligence

Note: This article aims to cover theoretical concepts as well as demonstrate practical hands-on examples given the information available online.


How can Machine Learning algorithms include better Causality?

#artificialintelligence

Note: This article aims to cover theoretical concepts as well as demonstrate practical hands-on examples given the information available online.